WO 00/05414 PCT/US99/164n 

-45- 

WHAT IS CLAIMED IS: 

1 . A high-throughput metliod for determining a biochemical function of a protein 
or polypeptide domain of unknown function comprising: 

(A) identifying a putative polypeptide domain that properly folds into a 

5 stable polypeptide domain, said stable polypeptide having a defined three 

dimensional structure: 

(B) determining three dimensional structure of the stable polypeptide domain 
from an automated analysis of NMR spectometer spectra of said 
polypeptide domain, wherein said automated analysis is conducted by a 

10 NOES Y^Assign process; 

(C) comparing the determined three dimensional structure of the stable 
polypeptide domain to known three-dimensional structures in a protein 
data bank, wherein said comparison identifies known structures within 
said protein data bank iliat are homologous to the determined three 

15 dimensional structure: and 

(D) correlating a biochemical function corresponding to the identified 
homologous structure to a biochemical function for the stable 
polypeptide domain. 

2. The method according t?>"claim 1 . further comprising the prestep of parsing a 
20 target polynucleotide into at leJtst one putative polypeptide domain. 

3. The method according i^>5l^im2. wherein said parsing is performed by a first 
computer algorithm, wherein said first computer algorithm is selected from the 
group consisting of a computer algoritlim capable of determining exon phase 
boundaries of a polynucleotide, and a computer algorithm capable of 

25 determining interdomain boundaries encoded in a polynucleotide. 

4. The method of"el;mn 3. further comprising a computer algorithm that compares 
the putative polypeptiBeaomain sequence with known domain sequences stored 
within a database. 

5. The method^fclaim 1 . wherein said NMR spectra are analyzed by a second 
30 computer algorithixrthat automatically assigns resonance assignments to the 

polypeptide sequence. 
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6. The method of claim^l, wherein said identification of said stable polypeptide 
domain comprises measuring a time course of amide hydrogen-deuterium 
exchange. 

7. Tlie method of claim I , wherein prior to step (B), said stable polypeptide domain 
5 is optimally soluBiKz^d-^said optimum solubilization comprising: 

i) preparing an array of microdialysis buttons, wherein each of said 
microdialysis buttons contains at least 1 ^il of an approximately ImM 
solution of said stable polypeptide domain; 

ii) dialyzing each member of said array of microdialysis buttons against a 
10 different dialysis buffer: 

iii) analyzing each of said dialyzed microdialysis buttons to determine 
whether said stable polypeptide domain has remained soluble; and 

iv) selecting the polypeptide domain having optimum solubilit}' 
characteristics for NMR spectroscopy. 



15 8. 



The methodoT&laiiji^ wherein said comparison of said determined three 
dimensional structure fo said known three-dimensional structures in the protein 
data bank is performed by a third computer algorithm that is capable of 
determining 3D structure homology between said determined three dimensional 
structure and a member of said PDB. 

20 9. The method according^to^clbi^^^ wherein said third computer algorithm is 
selected from the group consisiingW D ALL CATH and VAST. 



10. The method of claim 1. wherein said protein data bank is Protein Data Base 
("PDBn. 



1 1 . The method of''d;iim^4ovherein said database contains domain sequence 
25 information of known and determined domain sequences. 

12. An integrated system for rapid determination of a biochemical function of a 
protein or protein domain of unknown function: 

(A) a first computer algorithm capable of parsing said target polynucleotide 
into at least one putative domain encoding region; 
30 (B) a designated lab for expressing said putative domain; 
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(C) an NMR spectrometer for determining individual spin resonances of 
amino acids of said putative domain; 

(D) a data collection device capable of collecting NMR spectral data, 
wherein said data collection device is operatively coupled to said NMR 

5 spectrometen 

(E) at least one computer: 

(F) a second computer algorithm capable of assigning individual spin 
resonances to individual amino acids of a polypeptide: 

(G) a third computer algorithm capable of determining tertiary structure of a 
1 0 polypeptide, wherein said polypeptide has had resonances assigned to 

individual amino acids of said polypeptide; 

(H) a database, wherein stored within said database is infonnation about the 
structure and function of known proteins and determined proteins; and 

(I) a fourth computer algorithm capable of determining 3D structure 

1 5 homology between the determined three-dimensional structure of a 

polypeptide of unknown function to three-dimensional structure of a 
protein of known function, wherein said protein of known structure is 
stored within said protein database, wherein said fourth computer 
algorithm determines said structure by an automated NOESY_Assign 

20 process. 

13. A high-throughput method for determining a biochemical function of a 

polypeptide of unknown function encoded by a target polynucleotide comprising 
the steps: 

(A) identifying at least one putative polypeptide domain encoding region of 
25 the target polynucleotide (''parsing"); 

(B) expressing said putative polypeptide domain; 

(C) determining whether said expressed putative polypeptide domain forms a 
stable polypeptide domain having a defined three dimensional structure 
(^^trapping**); 

30 (D) determining the three dimensional structure of the stable polypeptide 

domain by an automated NOES Y_Assign process; 
(E) comparing the determined three dimensional structure of the stable 

polypeptide domain to knovm three dimensional structures in a Protein 
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Data Bank lo determine whether any such known structures are 
homologous lo the determined structure: and 
correlating a biochemical ftmction corresponding to the homologous 
structure to a biochemical function for the stable polypeptide domain. 
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